skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Sam, M"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract Blue carbon ecosystems such as seagrass meadows, mangrove forests, and salt marshes are important carbon sinks that can store carbon for millennia. Recently, organic matter sulfurization and pyritization have been proposed as mechanisms of net carbon storage in blue carbon ecosystems. At our study site, organic sulfur that is resistant to acid hydrolysis (protokerogen) is an order of magnitude less abundant than pyrite sulfur, suggesting a dominance of pyritization over sulfurization. The C/N ratios and carbon isotope compositions suggest that nearly half of total organic carbon and ≥ 80% of protokerogen is composed of marsh plant material. Sediment protokerogen appears to be sulfurized based on its low δ34S values (− 10‰), abundance of disulfides, and higher S/C ratio (~ 1.0%) relative to potential biogenic sulfur sources. However, the interpretation of protokerogen δ34S values is complicated by the wide range in sulfur isotope compositions of marsh plants. Evidence for sulfurization occurs within the shallowest sediments across different vegetation zones, yielding consistent products, while pyritization appears to be more sensitive to alterations in sediment redox conditions. Based on organic sulfur and pyrite content, sulfurization may be a more spatially consistent process than pyritization, with implications for carbon storage. The relative abundance of pyrite and protokerogen organic sulfur indicates that pyritization is favored at our study site, but this is likely to vary across the spectrum of blue carbon ecosystems. 
    more » « less
    Free, publicly-accessible full text available July 1, 2026
  2. BackgroundRisk-based screening for lung cancer is currently being considered in several countries; however, the optimal approach to determine eligibility remains unclear. Ensemble machine learning could support the development of highly parsimonious prediction models that maintain the performance of more complex models while maximising simplicity and generalisability, supporting the widespread adoption of personalised screening. In this work, we aimed to develop and validate ensemble machine learning models to determine eligibility for risk-based lung cancer screening. Methods and findingsFor model development, we used data from 216,714 ever-smokers recruited between 2006 and 2010 to the UK Biobank prospective cohort and 26,616 high-risk ever-smokers recruited between 2002 and 2004 to the control arm of the US National Lung Screening (NLST) randomised controlled trial. The NLST trial randomised high-risk smokers from 33 US centres with at least a 30 pack-year smoking history and fewer than 15 quit-years to annual CT or chest radiography screening for lung cancer. We externally validated our models among 49,593 participants in the chest radiography arm and all 80,659 ever-smoking participants in the US Prostate, Lung, Colorectal and Ovarian (PLCO) Screening Trial. The PLCO trial, recruiting from 1993 to 2001, analysed the impact of chest radiography or no chest radiography for lung cancer screening. We primarily validated in the PLCO chest radiography arm such that we could benchmark against comparator models developed within the PLCO control arm. Models were developed to predict the risk of 2 outcomes within 5 years from baseline: diagnosis of lung cancer and death from lung cancer. We assessed model discrimination (area under the receiver operating curve, AUC), calibration (calibration curves and expected/observed ratio), overall performance (Brier scores), and net benefit with decision curve analysis.Models predicting lung cancer death (UCL-D) and incidence (UCL-I) using 3 variables—age, smoking duration, and pack-years—achieved or exceeded parity in discrimination, overall performance, and net benefit with comparators currently in use, despite requiring only one-quarter of the predictors. In external validation in the PLCO trial, UCL-D had an AUC of 0.803 (95% CI: 0.783, 0.824) and was well calibrated with an expected/observed (E/O) ratio of 1.05 (95% CI: 0.95, 1.19). UCL-I had an AUC of 0.787 (95% CI: 0.771, 0.802), an E/O ratio of 1.0 (95% CI: 0.92, 1.07). The sensitivity of UCL-D was 85.5% and UCL-I was 83.9%, at 5-year risk thresholds of 0.68% and 1.17%, respectively, 7.9% and 6.2% higher than the USPSTF-2021 criteria at the same specificity. The main limitation of this study is that the models have not been validated outside of UK and US cohorts. ConclusionsWe present parsimonious ensemble machine learning models to predict the risk of lung cancer in ever-smokers, demonstrating a novel approach that could simplify the implementation of risk-based lung cancer screening in multiple settings. 
    more » « less
  3. An extension to the wave packet description of quantum plasmas is presented, where the wave packet can be elongated in arbitrary directions. A generalized Ewald summation is constructed for the wave packet models accounting for long-range Coulomb interactions and fermionic effects are approximated by purpose-built Pauli potentials, self-consistent with the wave packets used. We demonstrate its numerical implementation with good parallel support and close to linear scaling in particle number, used for comparisons with the more common wave packet employing isotropic states. Ground state and thermal properties are compared between the models with differences occurring primarily in the electronic subsystem. Especially, the electrical conductivity of dense hydrogen is investigated where a 15% increase in DC conductivity can be seen in our wave packet model compared with other models. This article is part of the theme issue ‘Dynamic and transient processes in warm dense matter’. 
    more » « less
  4. ABSTRACT The local distance ladder estimate of the Hubble constant (H0) is important in cosmology, given the recent tension with the early universe inference. We estimate H0 from the Type Ia supernova (SN Ia) distance ladder, inferring SN Ia distances with the hierarchical Bayesian SED model, BayeSN. This method has a notable advantage of being able to continuously model the optical and near-infrared (NIR) SN Ia light curves simultaneously. We use two independent distance indicators, Cepheids or the tip of the red giant branch (TRGB), to calibrate a Hubble-flow sample of 67 SNe Ia with optical and NIR data. We estimate H0 = 74.82 ± 0.97 (stat) $$\pm \, 0.84$$ (sys) km $${\rm s}^{-1}\, {\rm Mpc}^{-1}$$ when using the calibration with Cepheid distances to 37 host galaxies of 41 SNe Ia, and 70.92 ± 1.14 (stat) $$\pm \, 1.49$$ (sys) km $${\rm s}^{-1}\, {\rm Mpc}^{-1}$$ when using the calibration with TRGB distances to 15 host galaxies of 18 SNe Ia. For both methods, we find a low intrinsic scatter σint ≲ 0.1 mag. We test various selection criteria and do not find significant shifts in the estimate of H0. Simultaneous modelling of the optical and NIR yields up to ∼15  per cent reduction in H0 uncertainty compared to the equivalent optical-only cases. With improvements expected in other rungs of the distance ladder, leveraging joint optical-NIR SN Ia data can be critical to reducing the H0 error budget. 
    more » « less
  5. Abstract We review observations of solar activity, geomagnetic variation, and auroral visibility for the extreme geomagnetic storm on 1872 February 4. The extreme storm (referred to here as the Chapman–Silverman storm) apparently originated from a complex active region of moderate area (≈ 500μsh) that was favorably situated near disk center (S19° E05°). There is circumstantial evidence for an eruption from this region at 9–10 UT on 1872 February 3, based on the location, complexity, and evolution of the region, and on reports of prominence activations, which yields a plausible transit time of ≈29 hr to Earth. Magnetograms show that the storm began with a sudden commencement at ≈14:27 UT and allow a minimum Dst estimate of ≤ −834 nT. Overhead aurorae were credibly reported at Jacobabad (British India) and Shanghai (China), both at 19.°9 in magnetic latitude (MLAT) and 24.°2 in invariant latitude (ILAT). Auroral visibility was reported from 13 locations with MLAT below ∣20∣° for the 1872 storm (ranging from ∣10.°0∣–∣19.°9∣ MLAT) versus one each for the 1859 storm (∣17.°3∣ MLAT) and the 1921 storm (∣16.°2∣ MLAT). The auroral extension and conservative storm intensity indicate a magnetic storm of comparable strength to the extreme storms of 1859 September (25.°1 ± 0.°5 ILAT and −949 ± 31 nT) and 1921 May (27.°1 ILAT and −907 ± 132 nT), which places the 1872 storm among the three largest magnetic storms yet observed. 
    more » « less
  6. Abstract We present Young Supernova Experimentgrizyphotometry of SN 2021hpr, the third Type Ia supernova sibling to explode in the Cepheid calibrator galaxy, NGC 3147. Siblings are useful for improving SN-host distance estimates and investigating their contributions toward the SN Ia intrinsic scatter (post-standardization residual scatter in distance estimates). We thus develop a principled Bayesian framework for analyzing SN Ia siblings. At its core is the cosmology-independent relative intrinsic scatter parameter,σRel: the dispersion of siblings distance estimates relative to one another within a galaxy. It quantifies the contribution toward the total intrinsic scatter,σ0, from within-galaxy variations about the siblings’ common properties. It also affects the combined distance uncertainty. We present analytic formulae for computing aσRelposterior from individual siblings distances (estimated using any SN model). Applying a newly trainedBayeSNmodel, we fit the light curves of each sibling in NGC 3147 individually, to yield consistent distance estimates. However, the wideσRelposterior meansσRel≈σ0is not ruled out. We thus combine the distances by marginalizing overσRelwith an informative prior:σRel∼U(0,σ0). Simultaneously fitting the trio’s light curves improves constraints on distanceandeach sibling’s individual dust parameters, compared to individual fits. Higher correlation also tightens dust parameter constraints. Therefore,σRelmarginalization yields robust estimates of siblings distances for cosmology, as well as dust parameters for sibling–host correlation studies. Incorporating NGC 3147's Cepheid distance yieldsH0= 78.4 ± 6.5 km s−1Mpc−1. Our work motivates analyses of homogeneous siblings samples, to constrainσReland its SN-model dependence. 
    more » « less
  7. The LAGOS-US RESERVOIR data module (hereafter, RESERVOIR) classifies all 137,465 lakes > 4 hectares in the conterminous U.S. into one of the following three categories using a machine-learning predictive model based on visual interpretation of lake outlines and a classification rule based on lake shape. Natural Lakes (NLs) are defined as lakes that are likely to be entirely or mostly naturally-formed and that do not have large, flow-altering structures on or near them; Reservoir Class A’s (RSVR_A) are defined as lakes that are likely to be either human-made or highly human-altered by the presence of a relatively large water control structure that appears to significantly change the flow of water; and Reservoir Class B’s (RSVR_Bs) are lakes that are likely to be entirely human-made based on isolation from rivers and a highly angular shape that is rarely, if ever, seen in natural lakes also often. We trained the machine learning models on 12,162 manually-classified lakes to assign probabilities of a lake being in 1 of 2 of the categories (NL or RSVR), then we further classified the RSVR classification into either A or B based on NHD Fcodes, isolation, and angularity. The data module includes a detailed User Guide, metadata tables, and a data table that includes information such as location, lake geometry, surface water connectivity class, and official name. Using our definition, our classification indicates that over 46 % of lakes > 4 ha in the conterminous U.S. are reservoir lakes. These data can be combined with other LAGOS-US data modules and U.S. national databases using unique lake identifiers to study both reservoir lakes and natural lakes at broad scales. 
    more » « less
  8. null (Ed.)
    Abstract Background Water resources development promotes agricultural expansion and food security. But are these benefits offset by increased infectious disease risk? Dam construction on the Senegal River in 1986 was followed by agricultural expansion and increased transmission of human schistosomes. Yet the mechanisms linking these two processes at the individual and household levels remain unclear. We investigated the association between household land use and schistosome infection in children. Methods We analyzed cross-sectional household survey data ( n  = 655) collected in 16 rural villages in August 2016  across demographic, socio-economic and land use dimensions, which were matched to Schistosoma haematobium ( n  = 1232) and S. mansoni ( n  = 1222) infection data collected from school-aged children. Mixed effects regression determined the relationship between irrigated area and schistosome infection presence and intensity. Results Controlling for socio-economic and demographic risk factors, irrigated area cultivated by a household was associated with an increase in the presence of S. haematobium infection (odds ratio [ OR ] = 1.14; 95% confidence interval [95% CI ]: 1.03–1.28) but not S. mansoni infection ( OR  = 1.02; 95% CI : 0.93–1.11). Associations between infection intensity and irrigated area were positive but imprecise ( S. haematobium: rate ratio [ RR ] = 1.05; 95% CI : 0.98–1.13, S. mansoni : RR  = 1.09; 95% CI : 0.89–1.32). Conclusions Household engagement in irrigated agriculture increases individual risk of S. haematobium but not S. mansoni infection. Increased contact with irrigated landscapes likely drives exposure, with greater impacts on households relying on agricultural livelihoods. 
    more » « less
  9. Lamberton, Poppy H. (Ed.)
    Background Infectious disease risk is driven by three interrelated components: exposure, hazard, and vulnerability. For schistosomiasis, exposure occurs through contact with water, which is often tied to daily activities. Water contact, however, does not imply risk unless the environmental hazard of snails and parasites is also present in the water. By increasing reliance on hazardous activities and environments, socio-economic vulnerability can hinder reductions in exposure to a hazard. We aimed to quantify the contributions of exposure, hazard, and vulnerability to the presence and intensity of Schistosoma haematobium re-infection. Methodology/Principal findings In 13 villages along the Senegal River, we collected parasitological data from 821 school-aged children, survey data from 411 households where those children resided, and ecological data from all 24 village water access sites. We fit mixed-effects logistic and negative binomial regressions with indices of exposure, hazard, and vulnerability as explanatory variables of Schistosoma haematobium presence and intensity, respectively, controlling for demographic variables. Using multi-model inference to calculate the relative importance of each component of risk, we found that hazard (Ʃw i = 0.95) was the most important component of S . haematobium presence, followed by vulnerability (Ʃw i = 0.91). Exposure (Ʃw i = 1.00) was the most important component of S . haematobium intensity, followed by hazard (Ʃw i = 0.77). Model averaging quantified associations between each infection outcome and indices of exposure, hazard, and vulnerability, revealing a positive association between hazard and infection presence (OR = 1.49, 95% CI 1.12, 1.97), and a positive association between exposure and infection intensity (RR 2.59–3.86, depending on the category; all 95% CIs above 1) Conclusions/Significance Our findings underscore the linkages between social (exposure and vulnerability) and environmental (hazard) processes in the acquisition and accumulation of S . haematobium infection. This approach highlights the importance of implementing both social and environmental interventions to complement mass drug administration. 
    more » « less